Neuro-dynamic Programming for the Exploration of Unknown Graphs
نویسندگان
چکیده
In this paper, the problem of exploring stochastic graphs is addressed. The definition of the entropy related to the a-priori unknown parameters (the lengths of the a-priori unknown links) leads to the formulation of the problem as a stochastic optimal control one. The application of exact Dynamic Programming suffers the so-called curse of dimensionality. To overcome this drawback, an approximate technique is proposed making use of Neuro-Dynamic Programming. Exploiting the concept of frontier, any approximate solution of the problem is shown to generate a “proper” policy. Copyright c © 2005 IFAC
منابع مشابه
A Neuro-Fuzzy Model for a Dynamic Prediction of Milk Ultrafiltration Flux and Resistance
A neuro-fuzzy modeling tool (ANFIS) has been used to dynamically model cross flow ultrafiltration of milk. It aims to predict permeate flux and total hydraulic resistance as a function of transmembrane pressure, pH, temperature, fat, molecular weight cut off, and processing time. Dynamic modeling of ultrafiltration performance of colloidal systems (such as milk) is very important for design...
متن کاملAn Integer Programming Model and a Tabu Search Algorithm to Generate α-labeling of Special Classes of Quadratic Graphs
First, an integer programming model is proposed to find an α-labeling for quadratic graphs. Then, a Tabu search algorithm is developed to solve large scale problems. The proposed approach can generate α-labeling for special classes of quadratic graphs, not previously reported in the literature. Then, the main theorem of the paper is presented. We show how a problem in graph theory c...
متن کاملNeuro-Fuzzy Based Algorithm for Online Dynamic Voltage Stability Status Prediction Using Wide-Area Phasor Measurements
In this paper, a novel neuro-fuzzy based method combined with a feature selection technique is proposed for online dynamic voltage stability status prediction of power system. This technique uses synchronized phasors measured by phasor measurement units (PMUs) in a wide-area measurement system. In order to minimize the number of neuro-fuzzy inputs, training time and complication of neuro-fuzzy ...
متن کاملAdaptive Neuro-Fuzzy Inference System application for hydrothermal alteration mapping using ASTER data
The main problem associated with the traditional approach to image classification for the mapping of hydrothermal alteration is that materials not associated with hydrothermal alteration may be erroneously classified as hydrothermally altered due to the similar spectral properties of altered and unaltered minerals. The major objective of this paper is to investigate the potential of a neuro-fuz...
متن کاملAdaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence
In this paper, a novel neuro-optimal control scheme is proposed for unknown nonaffine nonlinear discretetime systems by using adaptive dynamic programming (ADP) method. A neuro identifier is established by established RNN model, the ADP method is utilized to design the approximate optimal controller. Two neural networks (NNs) are used to implement the iterative algorithm. The convergence of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005